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Abstract. A homotopy method for multi-objective optimization that pro¬ 
duces uniformly sampled Pareto fronts by construction is presented. While 
the algorithm is general, of particular interest is application to simulation- 
based engineering optimization problems where economy of function evalua¬ 
tions, smoothness of result, and time-to-solution are critical. The presented 
algorithm achieves an order of magnitude improvement over other geometri¬ 
cally motivated methods, like Normal Boundary Intersection and Normal Con¬ 
straint, with respect to solution evenness for similar computational expense. 
Furthermore, the resulting uniformity of solutions extends even to more dif¬ 
ficult problems, such as those appearing in common Evolutionary Algorithm 
test cases. 


1. Introduction 


The problem of scalar function minimization is ubiquitous in modern engineering 
sciences and a number of algorithms exist that exploit various functional character¬ 
istics to reach timely solutions. Far more difficult is the problem of vector function 
minimization, where a of number different, often conflicting, objectives are opti¬ 
mized in such a way as to strike a balance that pleases the end-user. The presence 
of multiple output dimensions requires a generalization of our concept of optimality, 
demands significantly more computational effort, as well as compounds the inherent 
difficulty of the problem. 

1.1. Problem Statement. Precisely stated, and adopting the notation of El , we 
seek to address the problem 


minF(x) = [Fi(x), F 2 (x),..., F fc (x)] r 

xG.D 


(i) 


subject to 

5 j(x) <0 Vje[l,m] 
hi (x) =0 VZ e [1, e] 


where the objective functions, Fj, are bounded and defined over the set D £ K". 
Here, x is a vector of independent design variables of length n and contained in D. 
In this formulation, the set of objective functions, {Fj;(x ): \/i £ [1, fc]}, form a map 


k 


( 2 ) 


F:R"a1' 


from design space to objective space. The set of all x G D satisfying the inequality 
constraints, <?j(x) < 0, and the equality constraints, Zq(x) = 0, 


( 3 ) 



is called the feasible set. It’s image in objective space 
(4) Z-. = {F(x): x £ X} 


is denoted the attainable set. 


l 
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In the general case, the F, functions can not be simultaneously optimized and 
compromise solutions must be considered. Therefore, our goal is to discover locally 
Pareto optimal points, which are solutions that can not be improved in all objectives 
simultaneously when compared to its neighbors in the feasible set. The set of 
all Pareto optimal points (when viewed in objective space) is called the Pareto 
front and explicitly maps the optimal values that can be attained when considering 
conflicting objectives. 

In this paper we propose a method for computing high-quality discrete approxi¬ 
mations of Pareto fronts. We begin by reviewing existing methods and commenting 
on their ammenability to large problems with many objectives. We then review 
previous work on a particular homotopy method and demonstrate an alternative 
formulation. After explicitly generalizing this approach to problems arbitary di¬ 
mension, we compare our formulation to methods using three test problems from 
the literature. Finally, we present an application of this method to a problem in 
the field of particle accelerator design and discuss performance related to various 
benchmarks. We conclude by mentioning futher improvements and future research 
directions. 


2. Related Work 

As a result of the ubiquity of problems that fit the formulation given in 0 , a 
variety of methods have been devised for sampling high-dimensional Pareto fronts. 

2.1. Scalarization. One simple method involves explicitly imposing an order on 
the objective space. This can be done, for example, by introducing a suitable scalar¬ 
ization function projecting the objective space onto, and using the well-ordered 
property of, the real numbers to define a solution. This effectively reduces the task 
of ([I]) to a scalar minimization problem amenable to solution via an number of 
established numerical optimization techniques. 

(5) min x£ x s(F(x); A) with s(F; A): R fc —» R 

While the approach is straightforward, the mapping between the scalarization pa¬ 
rameters (A) and corresponding solution’s location on the Pareto front generally is 
not, and fronts generated by blindly probing the A-space often suffer from highly 
irregular sampling ED- 

2.2. Evolutionary Algorithms. Rather than attempting to break the overall 
sampling task into a series of subproblems, as in the proposed scalarization ap¬ 
proach, population-based methods, generally speaking, attempt to solve the prob¬ 
lem as a whole. This is done, typically, by generating sets of mutually non- 
dominated solutions using Evolutionary Algorithms (EAs). A number of specific 
heuristics have been proposed (see m , ED, EL and ESI) using a variety of tech¬ 
niques to balance the discovery of non-dominated solutions with diversity criteria 
to discourage solution crowding. 

While EAs are capable of performing well on even the most difficult problems, 
some key aspects suggest that they are not ideal for all objective types. First, as 
in [2j, solutions are evaluated based on their relative dominance to other solutions 
in the population. Unlike scalarization methods that provide some necessary or 
sufficient conditions for optimality of solutions, this does not actually guarantee 
Pareto optimality of the resulting solution candidates. Additionally, the nature 
of the non-dominating fitness criteria inherently limits the scalability of this class 
of algorithms as it involves comparing elements of the population against each 
other, naturally implying some super-linear running time behavior with increasing 
population size 0 0. While parallelizing the evaluation step [5] or distributing 
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various objective space regions to different processors [ 1 ] can partially mitigate 
this effect, population size must still increase exponentially with objective space 
dimension to maintain an evenly sampled Pareto front. Furthermore, increasing 
the number of objectives distorts the utility of the dominance ranking criterion as 
the hypersurface-area to hypervolume effect permits a higher proportion of non- 
dominated solutions and weakens selection pressure [?). 

2.3. Geometric Methods. Rather than rely on a flood of function evaluations to 
explore the solution space, a final class of methods uses the scalarization approach, 
but exploits geometrical arguments to further restrict the attainable set for each 
optimization subproblem. Practically, this is accomplished by adding auxiliary 
constraint functions to steer the scalar optimization subroutines towards desirable 
solutions and deliver an evenly sampled front. The Normal Boundary Intersection 
(nbi) m and the Normal Constraint (NC) [I2J method essentially restrict each 
optimization subproblem to consider only solutions that lie on the normal vector 
emanating from the convex hull of the individual minimizers (CHIM). 

These methods make intuitive sense, as well as produce well sampled fronts. 
However, by restricting the attainable set to the CHIM normal, these approaches 
are only capable of capturing Pareto points that lie within the projection of this 
hull. Furthermore, these points are only equally distributed when viewed from the 
CHIM frame, as seen in figure [l] 

fi 



fi 


Figure 1. In the NBI and NC methods, evenly spaced points on 
the CHIM are projected to the Pareto front. Unfortunately, as a 
result of the front geometry, the resulting sample points are not 
always evenly spaced. The fact that d\ 7 ^ ^2 illustrates this effect. 

The specific algorithms also exhibit some scalability bottlenecks, especially when 
increasing the number of objectives, k. Since the boundaries of the Pareto front 
are not know a priori, it is often difficult to decide where to place solution points 
on the CHIM. In P3j and [TJ], it is suggested to divide the solution algorithm into 
two phases; the first samples the trade-offs between all combinations of individual 
objectives to establish the boundaries of the complete front. The second phase 
involves solving a set of scalar optimization subproblems to find Pareto optimal 
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points on the interior of this boundary. The first phase, consists of YlnZl (^A 1 ) 
“sub-fronts” to sample before the domain is properly bounded and sampling on the 
interior can begin. Furthermore, the recursive nature of sub-front sampling, such 
as this, limits the scalability to only i parallel sub-front’s per step of the initial 
phase. 


3. Motivating Example 

Consider a standard bi-objective minimization problem. 

( 6 ) minF(x) = [F 1 (x),F 2 (x)] T 

x£A 

A straightforward method for constructing a well-sampled discrete Pareto front 
representation, as discussed in El and m, involves scalarizing the objectives and 
adding explicit solution spacing terms to the equality constraint set. 

Using a simple weighted sum scalarization, we obtain the following scalar objec¬ 
tive function, 

(7) /(x, A) = (1 — A)Fi(x) + AF 2 (x) with 0 < A < 1 

the minimizer of which, min xg x fix, A) = f(A), is a parametric expression of the 
Pareto front. The endpoints, f(0) and f(l) correspond to the minima of the indi¬ 
vidual objective functions, -Fi(x) and F 2 (x), respectively, while the parameter A 
controls the transition between these extremes. 

The mapping between A and position along the Pareto front is often highly 
nonlinear. However, as presented in [1'8'j . we can exploit this additional degree of 
freedom to sample specific points along the curve by treating the A parameter as 
a design variable. This added flexibility is complemented by an auxilary equality 
constraint, typically of the form 

( 8 ) 11F (x) — F prev \\ 2 = y 2 

that explicitly enforces equal spacing between adjacent sample points. This allows 
a suitable scalar optimization procedure to minimize the scalarized objective func¬ 
tion ([7j) subject to the constraint that the result lies a certain distance, 7 , from 
a specified point, F prev . By seeding this procedure with either of the objective 
minima, for instance with F(xo) = f(0), we can iteratively generate evenly spaced 
samples by “marching” along the Pareto front carrying out the following scalarized 
minimization 


(9) 

min 

x p ex 

(1 - Ap)Fj(xp) + A p F 2 (x p ) with 0 < A n < 1 

( 10 ) 

subject to 

F(x p ) — F(x p _ 1 ) 2 = y 2 


for p up to the number of desired sample points, P (provided a suitable step-size, 
7 )- 

While this method produces well-sampled Pareto front representations, even for 
difficult problems, it requires sequential variation of the A-parameter and serial 
computation of the sample point locations, thereby limiting its scalability and ap¬ 
plication to large problems and those with expensive objective functions. 

The authors address this concern in [18] by extending the algorithm to paral¬ 
lel processing environments. This is done, primarily, by replacing references to 
previously computed sample points, F(x p _!), with continually updated position 
estimates, F(x p “ T 1 r ' eTlt ). This way, we create a set of semi-independent and simulta¬ 
neously solvable scalar optimization sub-problems, coupled only through the aux¬ 
ilary equispacing constraint ([ 8 |. In fact, we can distribute the subproblems to as 
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many processors as we have Pareto front sample points, thereby lifting the seal- 
ability limits imposed by the sequential algorithm. This iterative refinement, of 
course, requires defining a set of initial estimates, {F p £ [1,P]}, that form an 
ansatz front; however, this can be as simple as a set of equally spaced points on the 
line connecting the objective space images of the individual function minimizers. 


One concern with enforcing an explicit spacing constraint like (10 1 is that, with¬ 


out additional information about the arc-length of the true Pareto front, a suitable 
step-size parameter, 7 , can not be established a priori. One can imagine a boot¬ 
strapping process whereby the total arc-length of the front is estimated and refined 
by successive applications of the algorithm; however, this would involve serial rep¬ 
etitions of the front sampling procedure, limiting parallel scalability. 


A simpler approach that accomplishes the same goal is to adjust constraint (10) 


to, instead, explicitly enforce equal spacing between a point’s up- and down-stream 
neighbors. Specifically, a modified constraint of the form 


( 11 ) 


\F p - F 


p+i| 


- ||F P - F p _i| 


= 0 


eliminates the need to estimate the Pareto front arc-length a priori by not fixing the 
spacing between points and allowing the front representation to expand indefinitely. 


/2 



h 



Figure 2. The solid circle in the plot on the left represents the fea¬ 
sible region permitted by constraint (101 for ansatz node A , given 
unity a , while the solid line on the right represents the feasible 
region implied by constraint (11) for ansatz node B. 


Rephrasing the constraint this way, however, has other important ramifications, 
as well. First, it ameliorates an unfortunate consequence of the formulation in (10) 
that limits the optimizer’s improvement per iteration to 7 . Because of this limited 
step size, if the ansatz is sufficiently distant from the real Pareto optimal front, 
applying the method of [IS) will require multiple iterations to generate a set of 
truly optimal solutions. Constraint (11), however, is more in line with NBI and 
NC methods (described above) that do not restrict step sizes, allowing the sample 
points to converge to the Pareto front more quickly. 

Most importantly, by eliminating explicit references to the total arc-length, 
only local communication of current F p values between adjacent optimization sub¬ 
problems is required. This means we can concurrently compute uniformly dis¬ 
tributed Pareto front representations using only local, asynchronous communication 
between neighboring compute nodes, a key feature of massively scalable algorithms. 
In fact, the amount of parallelism is bounded only by the number of Pareto front 
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sample points required, P, which (to obtain a truly uniformly spaced representa¬ 
tion) often scales exponentially with the number of objectives. 


4. Multi-Dimensional Generalization 


Generalizing this approach to multiple objectives requires not only auxiliary 
scalarization variables, but a full generalization of the ansatz concept and corre¬ 
sponding constraints as well. The equidistance constraint naturally applies in the 
context of the bi-objective linear ansatz. In fact, during the first iteration, it reduces 
the attainable set in a manner that is functionally equivalent to the constraints used 
in the NBI and NC methods. Similarly confining the attainable set as the dimen¬ 
sion of the problem increases is accomplished by properly balancing the positions of 
neighboring ansatz nodes in fc-space. In addition to establishing this neighbor-wise 
balance, we identified a number of further requirements for any meshing solution: 

I An ansatz mesh must have an approximately equal distribution of points over 
its k — 1 area. 

II It should easily support a range of sampling resolutions. 

III It should scale with problem dimension. 

IV It should support distributed generation with low computational, as well as 
storage, overhead. 


To address these requirements, we supply an ansatz with the simplicity and seal- 
ability of a rectilinear, coordinate-parallel mesh that resides on the convex hull of 
the individual objective minimizers. Since we deal in a normalized objective space, 
we can generate the initial front sample points by intersecting a rectilinear grid with 
the unit k — 1 simplex. We then establish equidistance constraint relationships (or 
adjacency pairs) along the coordinate-parallel mesh axes. For instance, the scalar- 
ized sub-problem at point C will be augmented with two additional constraints, 

(12) and (13), both of which are based on As in the two objective prob¬ 

lem discussed above, this initially reduces the attainable set of this optimization 
subproblem to the simplex normal. 


(12) ||F C —F a || 2 —||F C —F b || 2 = 0 

(13) ||F C —F b || 2 -||F C —F b || 2 =0 


The scalarized subproblem at point D, however, will only be augmented with 
a single additional constraint, (14). This is because point D actually lies on the 
Pareto front of the bi-objective subproblem in the /i x f 2 plane and needs no further 
restriction of the attainable set. 


(14) IlFc-F^H 2 -||F B -Fj|| 2 =0 

The ansatz is completed by reprojecting the result into k -space resulting in a 
uniformly-distributed linear interpolation between the k individual objective min¬ 
imizers and preserving the coordinate-wise equal spacing between the constituent 
points. This process is illustrated for a three objective problem in figure [3] with the 
final result projected back into 3-space in figure [4] 

This ansatz concept addresses requirements[l]and[lT]by creating a uniform tensor- 
product mesh and supporting a range of axis-wise sampling densities. In this way, 
we avoid uneven sampling of the Pareto front and allow the user to explicitly specify 
the level of detail required. 

Moreover, the coordinate-parallel nature of this solution addresses characteristic 


III by adding only two more mesh point neighbors per additional dimension of 
the ansatz. In other words, the degree of each node in the graph is bounded 
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Figure 3. On the left, we generate the ansatz points by inter¬ 
secting a rectilinear grid with the convex hull of the individual 
minimizers, denoted by the shaded region (a k — 1 simplex). We 
establish neighbor-relations on the right using coordinate-parallel 
axes. 


/j 



Figure 4. The full ansatz for a three objective problem as seen 
in the /i x / 2 x f 3 space. 


by 2 (k — 1) while adequately reducing the attainable set. Ultimately, this means 
that the number of additional constraints added to the overarching optimization 
problem to ensure equispacing of the Pareto front, as well as the magnitude of the 
data dependency of the optimization sub-problem occurring at each mesh node, 
scales linearly with the number of target objectives. 

Finally, since all of the coordinate transforms and neighbor-discovering oper¬ 
ations can be accomplished independently for each point, this meshing strategy 
supports fully distributed generation, satisfying requirement m 

With a scalable ansatz concept in place, the fully generalized approach boils 
down to solving a scalar minimization problem, given by (15), for each Pareto front 
sample point, p. Here, an adjacency pair, a p , consists of a set of per-axis opposing 
neighbors of point p with a l p and a r p denoting the mesh indicies of the left and right 
neigbors, respectively. A(p) represents the set of all adjacency pairs of ansatz mesh 
point p. 











ADELMANN, ARBENZ, FOSTER, AND INEICHEN 


fl 



Figure 5. A similarly constructed ansatz for a four objective 
problem, projected onto the A x / 2 x / 3 subspace. 


min Ai Fj (x) 

xeX.A 

+ A 2 F 2 (x) + ... + AfcF fc (x) with 0 < A^ < 1 

subject to 


tb( x ) 

< 0 Vj £ [1 ,m] 

M x ) 

= 0 VZ€[l,e] 

k 

Xi 

i —i 

= 1, Vi £ [1, k] 

l|F(x) 

F a «|| 2 H|F(x)-F a; || 2 =0 Vo p £ A{p) 


5. Results 


5.1. Metrics. Having described the new algorithm (15), we turn to evaluating its 
performance relative to the other methods described. We discuss a set of metrics for 
measuring front accuracy as well as quality (with a focus on evenly sampled repre¬ 
sentations) and compute them for a set of multi-objective optimization benchmark 
problems taken from the literature. 


5.2. Hypervolume. A useful and, more importantly, scalable metric for measur¬ 
ing the accuracy and quality of a Pareto front approximation is the hypervolume 
of the dominated space. Effectively, given a point in the attainable set, we measure 
the k dimensional volume of the dominated space, relative to some “worst-case” 
point, as shown in figure [ 6 ] 

The particular convenience of this metric lies in the fact that the limit of a 
perfectly sampled Pareto front will converge to a concrete value, representing the 
total dominated volume. Given the fact that the true Pareto optimal front is the 
maximal set of the attainable region poset, with this metric, we are able to directly 
measure convergence to the true Pareto front. We employed the implementation 
provided by |T5j for efficient computation of hypervolumes. 

5.3. Evenness. Beyond simply converging to the Pareto optimal front, we aim to 
ensure a uniformly sampled approximation in an effort to maximize both economy of 
function evaluations as well as permit useful interpretation of results. A number of 
methods have been proposed to capture the essence of uniformity, from normalized 
root mean square (RMS) distances between adjacent points [d_Zi to the maximum 
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lower bound of all inter-point distances however, these methods do not gen¬ 
eralize to varying objective scales and are not well adopted by the community at 
large. In [12] , the authors recommend a measure of “evenness” that corresponds to 
the intuitive notion that no region of the Pareto front is over- or under-represented 
by the discrete approximation. Given a point Fi in the approximation set, we first 
consider the nearest neighbor distance, d \, to another point in the set, as in figure 
[6] Next, we consider the largest sphere, containing no points from the set, that 
can be constructed such that Fi and another point, Fj , both lie on the surface and 
denote the diameter, d“. Then, given the set 

(16) d: ={4,d?: Vie [1,5]} 
we define the evenness, E, as 

(17) E = a d /d 

the ratio of this set’s standard deviation, a dl to its mean, d, implying that a perfectly 
uniform distribtion has E = 0. This metric generalizes to multiple dimensions quite 
easily by considering hyperspheres when determining the d l and d u values. 


h 


fi 


- - hj Non-Dominated Space 


B 


c; 

Dominated Spalce 


fi 


• Fi 


‘74 \ 


1 ' 

1 

\ 

l 1 

i \ jii, 


sdi | 

\ - 

i \ ' 
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fi 


Figure 6 . The hypervolume of a Pareto front is the total volume 
of the dominated space, as illustrated on the left. Computed with 
respect to the origin, this equals the volume of the union of sets 
A, B, and C. With regard to the evenness metric, the plot on the 
right demonstrates the construction of d\ and d“ for a point E. 


5.4. Function Evaluations &: Time-to-solution. Finally, while the previous 
two metrics quantify front accuracy and quality, we aim to measure the amount of 
computational effort required to obtain the given solutions. To directly compare 
algorithms, independent of specific implementation or computing platform, we re¬ 
port the number of objective function evaluations per Pareto point used to generate 
front approximations for the synthetic test problems. 

5.5. Test Problems. Since the homotopy method presented here primarily relies 
on augmenting the constraint functions, h(x), with additional equispacing con¬ 
straints, we chose to directly compare the accuracy and quality of the Pareto front 
approximation with those produced by similar methods, like NBI and NC. There¬ 
fore, we selected a set of test problems taken from the recent literature m and 
where some of the above metrics are reported for a variety of scalarization and 
geometric methods. 
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5.6. Problem 1. The first problem is relatively easy, consisting of linear functions 
and convex constraints. 

(18) min/j(x) = Xj for i = 1, 2 ,3 

X 

subject to 

— x\ + 1 + £3 1 < 0 

Xj" 1 — X2 + Xg 1 < 0 

xj" 1 TxjT 1 -x 3 < 0 

0.2 < Xi < 10 

Using the simplical CHIM 120-point ansatz, we obtained a smooth, connected 
Pareto front (figure [7| covering most of the first octant using four iterations of our 
algorithm and a few thousand function evaluations. 



Figure 7. Pareto front for problem (18) consisting of 120 points 
obtained with 6,264 function evaluations. 


We computed evenness values, as well as state the number of function evaluations 
required per Pareto point, and compare them to reported values from the literature 
in table [T| 


5.7. Problem 2. The other problem is an extension of (18) to four objectives. 


(19) min/.j(x) = x, for i = 1, 2, 3,4 

X 

subject to 

—xi + xjT 1 + x^ 1 + X4 1 < 0 
xj" 1 - x 2 + Xg 1 + xj 1 < 0 
xq 1 + xi 1 — x’3 + xj 1 < 0 

(20) xj" 1 + x-j" 1 + Xg 1 — X4 < 0 

0.2 < Xi < 10 

Likewise, with a simplicial CHIM 220-point ansatz, we obtained a smooth, con¬ 
nected Pareto front (figure [8]) using two iterations of our algorithm. 
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Table 1. Problem (181 Results, A comparison of solution quality 
(evenness [E]) and computational effort (function evaluations per 
point [FE/Point]) for problem (18). Reference values for WS, 
NBIm, and NCm algorithms were obtained from [13]. It is not clear 
whether the number of function evaluations reported includes those 
induced by finite difference schemes, or if analytical derivatives 
were used in the scalar minimizations. Our results were obtained 
with a first-order finite difference implementation. 


Method E FE/Point 


WS 

4.331 

120.2 

NBIm 

0.2958 

34.3 

NCm 

0.2958 

34.4 

Our new algorithm (15) 

0.01930 

52.2 




ft 
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Figure 8 . Pareto front for problem (19) consisting of 220 points 
obtained with 10,670 function evaluations. The value of the fourth 
objective for each point is shown using the red-blue color-map. 


As before, we computed evenness values, and tracked the number of function 
evaluations required per Pareto point. Table [2] compares the results to reported 
values from the literature. 


5.8. Problem 3. On the other hand, the EA community tends to focus on differ¬ 
ent kinds of problems. EAs, being driven by random generation and recombination 
of solutions, are typically useful in situations where derivative information is un¬ 
available or unhelpful. As a result, test problem suites intended for EAs employ 
highly non-convex, rapidly oscillating, and extremely discontinuous objectives that 
are often inappropriate for constraint-based methods, like the one presented here. 
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Table 2. Problem (191 Results A comparison of solution quality 
(evenness [E]) and computational effort (function evaluations per 
point [FE/Point]) for problem (19). Reference values for WS, 
NBIm, and NCm algorithms were obtained from (131. 


Method 

E 

FE/Point 

WS 

4.836 

231.9 

NBIm 

0.3262 

49.3 

NCm 

0.3072 

55.3 

Our new algorithm (151 

0.02091 

48.5 


Therefore, we limit our discussion to a representative example as the primary moti¬ 
vation is to demonstrate the precise equispacing of produced solutions and economy 
of function evaluations. 


(21) a; min 

X 

subject to 
/i(x) 
f 2 (x) 
/s(x) 


fi (x) for i = 1,2,3 


(1 + g(x)) cos(aq7r/2) cos(a;27r/2) 
(1 + g(x)) cos(a;i7r/2) sin(a:27r/2) 
(l + 3(x))sin(x 1 7r/2) 


d 

g(x) = — 0.5) 2 with 0 < Xi < 1 for i = 1, 2 

i -2 


We selected a workable problem, defined in plj ) , from a standard EA test suite 
0 and compare the performance to some other heuristics noted in the literature. 

The resulting Pareto front, as seen if figure [9j is computed using significantly 
fewer function evalutations, obtains a much better hypervolume than any of the 
algorithms compared in either na or m, and displays more regular sampling than 

ra¬ 
in figure [T0l we demonstrate the asymptotic improvement in both hypervolume 
(computed with respect to the origin) and evenness. This implies that, for certain 
simple front topologies, algorithm 11 5 1 can be terminated after only a few iterations. 
Moreover, figure [To| shows that further iterations of the algorithm tend to improve 
the spacing of points more than the obtained hypervolume. This is a result of 
directly running a set of scalar optimizations (allowing samples to converge to the 
true optimal front more quickly) rather than ranking and recombining randomly 
generated solutions. 


6. Conclusions 


Here, an new algorithm (151 for multi-objective optimization that produces uni¬ 
formly sampled Pareto fronts by construction is presented. While the algorithm is 
general, it is most suitable for application to simulation-based engineering optimiza¬ 
tion problems where economy of function evaluations and smoothness of result are 
critical. The algorithm discussed achieves an order of magnitude improvement over 
other geometrically motivated methods, like Normal Boundary Intersection and 
Normal Constraint, with respect to solution evenness for similar computational 












Hypervolume 


A HOMOTOPY METHOD FOR LARGE-SCALE MULTI-OBJECTIVE OPTIMIZATION 13 




Figure 9. Pareto front for problem (21) consisting of 120 points 
obtained with 15,765 function evaluations. The finite differences 
scheme coupled with the fact that the problem has 10 independent 
variables gives rise to the very high number of function evaluations 
required. For examples of typical fronts generated by NSGA-II and 
SPEA2, see 0. 



Function Evaluations 


Figure 10. Solution quality metrics for Pareto front for prob¬ 
lem (21 1 . Both the solution hypervolume and evenness improve 
asymptotically as the number of function evaluations increase. In 
this example, one iteration of algorithm [15] corresponds to about 
3,000 function evaluations. The front hypervolume is computed 
with respect to the origin. 
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expense. This benefit of the proposed method remains, and even improves, af¬ 
ter scaling the number of dimensions (and therefore the difficulty of the problem). 
Furthermore, the resulting uniformity of solutions extends even to more difficult 
problems, such as those appearing in common EA test cases. 

While the resulting discrete representation of the Pareto front, and computa¬ 
tional expense of achieving it, are both improved, an other important aspect of 
the proposed method is its amenability to parallelization. We will report on the 
parallel aspects together with real world problems [6] in a forthcoming paper. 
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